A Proficient Apprehension-Based Mining Replica
نویسنده
چکیده
Most of the frequent techniques in text mining are based on the arithmetic scrutiny of a idiom, either word or slogan. Arithmetical scrutiny of a term incidence captures the consequence of the term within a manuscript only. However, two provisos can have the similar regularity in their documents, but one term contributes more to the connotation of its sentences than the further term. Thus, the essential text mining mold should designate terms that incarcerate the semantics of text. In this case, the mining replica can incarcerate terms that current the concepts of the condemnation, which leads to innovation of the topic of the document. A novel concept-based mining replica that analyzes terms on the condemnation, document, and corpus levels is introduced. The concept-based mining replica can efficiently distinguish between non imperative terms with esteem to sentence semantics and terms which hold the concepts that symbolize the sentence connotation. The proposed mining replica consists of sentence-based impression scrutiny, document-based perception analysis, corpus-based concept-analysis, and conceptbased resemblance determine. The term which contributes to the condemnation semantics is analyzed on the condemnation, document, and quantity levels relatively than the conventional investigation of the manuscript only. The projected replica can proficiently find considerable toning concepts between documents, according to the semantics of their sentences. The comparison between documents is premeditated based on a new concept-based comparison assess. The proposed correspondence compute takes full improvement of using the perception investigation procedures on the judgment, document, and quantity levels in manipulative the comparison between documents. Large sets of experiments using the projected concept-based mining replica on unusual data sets in text clustering are conducted. The experiments express general contrast between the concept-based investigation and the habitual analysis. Tentative consequences reveal the considerable enrichment of the clustering superiority using the sentence-based, document-based, corpus-based, and united loom concept analysis. KeywordsConcept-based mining replica, sentence-based, document-based, corpus-based, concept analysis, theoretical term frequency, concept-based similarity
منابع مشابه
Rhetorical Replica (Badal Bilaqi) and Its Variants in Hafez’s Sonnets
One of the stylistic features of Hafez’s sonnets is the repetition of a part of the meaning of the first line in the second line. His knowledge of the rhetorical and semantic relations of vocabularies enabled him to repeat the meaning with the least verbal repetition. One of the ways that has helped him to achieve this goal is replicating the concepts in two parts of the couplet based on rhetor...
متن کاملAn Efficient Data Replication Strategy in Large-Scale Data Grid Environments Based on Availability and Popularity
The data grid technology, which uses the scale of the Internet to solve storage limitation for the huge amount of data, has become one of the hot research topics. Recently, data replication strategies have been widely employed in distributed environment to copy frequently accessed data in suitable sites. The primary purposes are shortening distance of file transmission and achieving files from ...
متن کاملThe Role of Communication Apprehension and Fear of Negative Evaluation in Instagram and Selfie use
Today, social networks and smart phones have become very popular. One of the interesting topics in the field of information science and cognition is the study of userschr(chr(chr(chr('39')39chr('39'))39chr(chr('39')39chr('39')))39chr(chr(chr('39')39chr('39'))39chr(chr('39')39chr('39')))) information behavior in mobile-based social networks. In this area, this study examines the role of psycholo...
متن کاملImproving Data Grids Performance by Using Modified Dynamic Hierarchical Replication Strategy
Abstract: A Data Grid connects a collection of geographically distributed computational and storage resources that enables users to share data and other resources. Data replication, a technique much discussed by Data Grid researchers in recent years creates multiple copies of file and places them in various locations to shorten file access times. In this paper, a dynamic data replication strate...
متن کاملMore Proficient vs. Less Proficient EFL Learners’ Perceptions of Teachers ‘Motivation Raising Strategies
Motivation raising strategies are frequently used in English as a Foreign Language (EFL) classes; nevertheless, learners’ perceptions of such strategies used by language teachers have not sufficiently been explored. Also, there are not enough studies on differences and similarities between more and less proficient EFL learners regarding this issue. To scrutinize this topic, a groups of more (No...
متن کامل